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Abstract. The Rubik's Cube is perhaps the world's most famous and 
iconic puzzle, well-known to have a rich underlying mathematical struc- 
ture (group theory). In this paper, we show that the Rubik's Cube also 
has a rich underlying algorithmic structure. Specifically, we show that 
the nxnxn Rubik's Cube, as well as the nxnxl variant, has a "God's 
Number" (diameter of the configuration space) of 0(n 2 / logn). The up- 
per bound comes from effectively parallelizing standard 0(n 2 ) solution 
algorithms, while the lower bound follows from a counting argument. 
The upper bound gives an asymptotically optimal algorithm for solving 
a general Rubik's Cube in the worst case. Given a specific starting state, 
we show how to find the shortest solution in an n x O(l) x O(l) Rubik's 
Cube. Finally, we show that finding this optimal solution becomes NP- 
hard inannxnxl Rubik's Cube when the positions and colors of some 
of the cubies are ignored (not used in determining whether the cube is 
solved) . 

Keywords: combinatorial puzzles, diameter, God's number, combina- 
torial optimization 

1 Introduction 

A little over thirty years ago, Hungarian architecture professor Erno Rubik re- 
leased his "Magic Cube" to the world^ What we now all know as the Rubik's 
Cube quickly became a sensation [2B]. It is the best-selling puzzle ever, at over 
350 million units [TS] . It is a tribute to elegant design, being part of the perma- 
nent collection of the Museum of Modern Art in New York [TH] . It is an icon for 
difficult puzzles — an intellectual Mount Everest. It is the heart of World Cube 
Association's speed-cubing competitions, whose current record holders can solve 
a cube in under 7 seconds (or 31 seconds blindfold) Q]. It is the basis for cube 
art, a form of pop art made from many carefully unsolved Rubik's Cubes. (For 

4 Similar puzzles were invented around the same time in the United States [9] |17|. the 
United Kingdom [6], and Japan [10] but did not reach the same level of success. 
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example, the recent movie Exit Through the Gift Shop features the street cube 
artist known as Space Invader.) It is the bane of many computers, which spent 
about 35 CPU years determining in 2010 that the best algorithm to solve the 
worst configuration requires exactly 20 moves — referred to as God's Number [22] . 

To a mathematician, or a student taking abstract algebra, the Rubik's Cube 
is a shining example of group theory. The configurations of the Rubik's Cube, 
or equivalcntly the transformations from one configuration to another, form a 
subgroup of a permutation group, generated by the basic twist moves. This 
perspective makes it easier to prove (and compute) that the configuration space 
falls into two connected components, according to the parity of the permutation 
on the cubies (the individual subcubes that make up the puzzle). See [7] for how 
to compute the number of elements in the group generated by the basic Rubik's 
Cube moves (or any set of permutations) in polynomial time. 

To a theoretical computer scientist, the Rubik's Cube and its many gen- 
eralizations suggest several natural open problems. What are good algorithms 
for solving a given Rubik's Cube puzzle? What is an optimal worst-case bound 
on the number of moves? What is the complexity of optimizing the number of 
moves required for a given starting configuration? Although God's Number is 
known to be 20 for the 3x3x3, the optimal solution of each configuration in 
this constant-size puzzle still has not been computed |22| : even writing down 
the first move in each solution would take about 8 exabytes (after factoring out 
symmetries) . While computing the exact behavior for larger cubes is out of the 
question, how does the worst-case number of moves and complexity scale with 
the side lengths of the cube? In parallel with our work, these questions were 
recently posed by Andy Drucker and Jeff Erickson 0]. Scalability is important 
given the commercially available 4x4x4 Rubik's Revenge [35] ; 5 x 5 x 5 Profes- 
sor's Cube [13]; the 6 x 6 x 6 and 7 x 7 x 7 V-CUBEs [28] whose design enables 
cubes as large as 11 x 11 x 11 according to Verdes's design patent [30]; Leslie Le's 
3D-printed 12 x 12 x 12 [13]: and Oskar van Deventer's 17 x 17 x 17 Over the Top 
and his 2 x 2 x 20 Overlap Cube, both available from 3D printer shapeways [2"5] . 

Diameter / God's Number. The diameter of the configuration space of a Rubik's 
Cube seems difficult to capture using just group theory. In general, a set of 
permutations (moves) can generate a group with superpolynomial diameter [3]. 
If we restrict each generator (move) to manipulate only k elements, then the 
diameter is 0(n k ) [16], but this gives very weak (superexponential) upper bounds 
for n x n x n and n x n x 1 Rubik's Cubes. 

Fortunately, we show that the general approach taken by folk algorithms for 
solving Rubik's Cubes of various fixed sizes can be generalized to perform a 
constant number of moves per cubie, for an upper bound of 0(n 2 ). This result 
is essentially standard, but we take care to ensure that all cases can indeed be 
handled. 

Surprisingly, this bound is not optimal. Each twist move in the nx nx n and 
nxnxl Rubik's Cubes simultaneously transforms n e ^> cubies (with the expo- 
nent depending on the dimensions and whether a move transforms a plane or a 
half-space) . This property offers a form of parallelism for solving multiple cubies 
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at once, to the extent that multiple cubies want the same move to be applied at 
a particular time. We show that this parallelism can be exploited to reduce the 
number of moves by a logarithmic factor, to 0(n 2 / log n). Furthermore, an easy 
counting argument shows an average-case lower bound of J2(n 2 / logn), because 
the number of configurations is 2 <s> (™ ' and there are 0(n) possible moves from 
each configuration. 

Thus we settle the diameter of the nxnxn and nxnxl Rubik's Cubes, up 
to constant factors. These results are described in Sections [4] and [3j respectively. 

n 2 — 1 puzzle. Another puzzle that can be described as a permutation group given 
by generators corresponding to valid moves is the n x n generalization of the 
classic Fifteen Puzzle. This n 2 — 1 puzzle also has polynomial diameter, though 
lacking any form of parallelism, the diameter is simply 0(n 3 ) |20j . Interestingly, 
however, computing the shortest solution from a given configuration of the puzzle 
is NP-complete [5T]. More generally, given a set of generator permutations, it 
is PSPACE-complete to find the shortest sequence of generators whose product 
is a given target permutation |5lllj . These papers mention the Rubik's Cube 
as motivation, but neither address the natural question: is it NP-complete to 
solve a given nxnxn or nxnxl Rubik's Cube using the fewest possible 
moves? Although the nxnxn problem was posed as early as 1984 |2l21j . both 
questions remain open [12) . We give partial progress toward hardness, as well as 
a polynomial-time exact algorithm for a particular generalization of the Rubik's 
Cube. 

Optimization algorithms. We give one positive and one negative result about 
finding the shortest solution from a given configuration of a generalized Rubik's 
Cube puzzle. On the positive side, we show in Section [6] how to compute the ex- 
act optimum for n x 0(1) x 0(1) Rubik's Cubes. Essentially, we prove structural 
results about how an optimal solution decomposes into moves in the long dimen- 
sion and the two short dimensions, and use this structure to obtain a dynamic 
program. This result may prove useful for optimally solving configurations of 
Oskar van Deventer's 2x2x20 Overlap Cube , but it does not apply to the 
3x3x3 Rubik's Cube because we need n to be distinct from the other two side 
lengths. On the negative side, we prove in Section [5] that it is NP-hard to find 
an optimal solution to a subset of cubies in an n x n x 1 Rubik's Cube. Phrased 
differently, optimally solving a given nxnxl Rubik's Cube configuration is 
NP-hard when the colors and positions of some cubies are ignored (i.e., they are 
not considered in determining whether the cube is solved). 

2 Common Definitions 

We begin with some terminology. An £ x m x n Rubik's Cube is composed of 
Imn cubies, each of which has some position (x, y, z), where x € {0, 1, ... ,1— 1}, 
j/G {0, 1, ... , to— 1}, and z £ {0,1, ... , n— 1}. Each cubie also has an orientation. 
Each cubie in a Rubik's Cube has a color on each visible face. There are six colors 
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in total. We say that a Rubik's Cube is solved when each face of the cube is the 
same color, unique for each face. 

A slice of a Rubik's Cube is a set of cubies that match in one coordinate 
(e.g. all of the cubies such that y = 1). A legal move on a Rubik's Cube involves 
rotating one slice around its perpendiculaij^] In order to preserve the shape 
of the cube, there are restrictions on how much the slice can be rotated. If 
the slice to be rotated is a square, then the slice can be rotated 90° in either 
direction. Otherwise, the slice can only be rotated by 180°. Finally, note that if 
one dimension of the cube has length 1, we disallow rotations of the only slice in 
that dimension. For example, we cannot rotate the slice z = in the hxuxI 
cube. 

A configuration of a Rubik's Cube is a mapping from each visible face of each 
cubie to a color. A reachable configuration of a Rubik's Cube is a configuration 
which can be reached from a solved Rubik's Cube via a sequence of legal moves. 

For each of the Rubik's Cube variants we consider, we will define the contents 
of a cubie cluster. The cubies which belong in this cubie cluster depend on the 
problem we are working on; however, they do share some key properties: 

1. Each cubie cluster consists of a constant number of cubies. 

2. No sequence of legal moves can cause any cubie to move from one cubie 
cluster into another. 

Each cubie cluster has a cluster configuration mapping from each visible face of 
the cubie cluster to its color. Because the number of cubies in a cubie cluster is 
constant, the number of possible cluster configurations is also constant. 

We say that a move affects a cubie cluster if the move causes at least one 
cubie in the cubie cluster to change places. Similarly, we say that a sequence of 
moves affects a cubie cluster if at least one cubie in the cubie cluster will change 
position or orientation after the sequence of moves has been performed. 



3 Diameter of n x n X 1 Rubik's Cube 

When considering an n x n x 1 Rubik's Cube we omit the third coordinate of a 
cubie, which by necessity must be 0. For simplicity, we restrict the set of solutions 
to those configurations where the top of the cube is red and the bottom of the 
cube is blue. 

Consider the set of locations that a cubie at position (x,y) can reach. If we 
flip column x, the cubie will move to position (x, n — y — 1) . If we instead flip row 
y, the cubie will move to position (n — x — 1, y). Consequently, there are at most 
four reachable locations for a cubie that starts at (x,y): (x,y), (x,n — y — 1), 
(n ~ x — l,y), and (n — x — l,n — y — 1). We call this set of locations the cubie 
cluster (x, y). 

5 While other reasonable definitions of a legal move exist (e.g. rotating a set of con- 
tiguous parallel slices), this move definition most closely matches the definition used 
in popular move notations. 
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If a cubie is in the first or last row or column, we call it an edge cubie. If a 
cluster contains an edge cubie, then we call it an edge cluster, because all of its 
cubies are edge cubies. The special edge cluster which contains the cubie in the 
first row and first column is called the corner cluster, and its cubies are corner 
cubies. If n is odd, then there is a cluster with one cubie which is in both the 
median row and the median column. We will call this cubie the center cubie or 
center cluster. In addition, if n is odd then there are also clusters with two cubies 
found in the median row or column. We call the clusters cross clusters and the 
cubies in them cross cubies. 

We begin by showing that for any reachable cluster configuration, there exists 
a sequence of moves of constant length which can be used to solve that cluster 
without affecting any other clusters. 

In the remainder of Section [3j we use the notation Hi , H 2 and V\ , V% to 
denote the two rows and columns containing cubies from a single cubie cluster. 
We also use the same symbols to denote single moves affecting these rows and 
columns. Recall that for a n x n x 1 cube there is only one valid non-identity 
operation on a row or column: rotation by 180°. In the special cases of cross and 
center cubie clusters, we denote the single row or column containing the cluster 
by H or V, respectively. 

We begin by proving the following lemma about cubie cluster configurations. 



Lemma 1. In a solvable n x n x 1 Rubik's Cube, the colors on the top faces of 
the cubies in a cubie cluster can only be in the six configurations in Fig. [7] 

Proof. Consider what happens to a cubie cluster when a move is performed. If 
the move does not affect the cubie cluster, then its cubie configuration will not 
change. Otherwise, the move will swap two cubies in the same row or column 
while reversing the color of each cubie. If both cubies are the same color, then 
both cubies will become the other color. In other words, if one cubie is red and 
the other is blue, then the color configuration will not change. 



Figure 1(a) shows the solved configuration, which is quite obviously reach- 



able. The four moves which affect this cubie cluster result in configurations 1(b) 
1(c) l(d)| an d 1(e) Consider how the four possible moves affect each of configu- 



rations [1(b) 1(c) , 1(d) and 1(e) For each configuration, two of the four possible 
moves involve one red cubie and one blue cubie, and therefore do not affect the 
colors. In addition, one move for each configuration is the inverse of the move 



used to reach that configuration, and therefore leads back to configuration 1(a) 



The final move for each configuration results in configuration 1(f) thus complet- 
ing the set of reachable configurations. □ 

Because of these limitations on cubie cluster configurations, we can prove the 
following lemma. 

Lemma 2. All six cluster configurations from Lemma^ can be solved using a 
sequence of at most six moves that does not affect the position of any cubies not 
in the cubie cluster. (See Fig. [TJj 
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Vi V 2 



Vi v 2 



Vi v 2 





(b) V u Hi,Vi,Hi. 



Vx V 2 





(c) V 2 ,H 1 ,V 2 ,H 1 . 



V l V 2 




(d) Hi,Vi,Hi,Vi. (e) H 2 ,Vi,H 2 ,Vx. (f) H 1 ,H 2 ,Vi,Hi,H 2 ,Vi. 

Fig. 1. The reachable cluster configurations and the move sequences to solve them. 



Proof. The correct move sequence for each configuration from Lemma [T] is given 
in Fig. [I] The fact that we always use each move twice ensures that all other 
clusters will not be affected by the move sequence. The correctness of these 
sequences can be verified by the reader. □ 

In order to handle the edge, corner, and cross clusters, we need a more 
complicated sequence of moves. These clusters cannot always be solved without 
affecting any other cubes. So rather than show that we can solve each cluster 
individually, we show that we can solve all such clusters together. 

Lemma 3. Given a solvable configuration of an n x n x 1 Rubik's Cube, there 
exists a sequence of moves of length O(n) which can be used to solve the edge 
clusters and cross clusters. 

Proof. We begin by solving the corner cluster, and, if n is odd, the center cluster 
and the two edge cross clusters. These four clusters combined have only O(l) 
reachable configurations, and so all four can be obviously be fixed in O(l) moves, 
disregarding the effect that these moves might have on any other clusters. Next 
we solve the other edge clusters. Our goal is to solve each cluster without affecting 
any of the clusters we have previously solved. 

Without loss of generality, say that we are trying to solve an edge cluster 
with coordinates (x, 0). We begin by using the move sequences from Lemma [5] 
to make sure that the cluster has all four red stickers visible. Then it will be in 
one of the states depicted in Fig. [2j To solve the cluster, we can use the move 
sequences given in Fig. [2j Although the given move sequences are not guaranteed 
to apply the identity permutation to all other clusters, they do have the property 
that any horizontal move will be performed an even number of times. Hence, this 
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move sequence will apply the identity permutation to all other edge clusters. In 
addition, none of the move sequences affect the center cubie. 



Vi V 2 



(a) Solved. 



Hi 



H-2 



Vi V 2 



Hi 



H 



Hi 



H 2 



Vi V 2 



(b) Hi,Vi,H x . 



{c)Hi,V 2 ,Hi. (d) Hi,Vi,V 2 ,H x . 



Fig. 2. When all four red stickers are facing forwards, these are the possible configu- 
rations for an edge cluster, and the move sequences used to solve them. 



Once all the edge clusters have been solved, we want to solve the cross clus- 
ters. We know that the center cluster has already been solved. We also know that 
there are 0{n) cross clusters, so if we can solve each cross cluster in O(l) moves 
without affecting the rest of the clusters, then we will have solved the edge and 
cross clusters in a total of 0(n) moves. Without loss of generality, say that we 
are trying to solve the cross cluster ((n — l)/2, y). The four possible states for a 
cross cluster are depicted in Fig. [3] 



V V V V 



Hi 








Hi 




Hi 




H 2 


" 


H 2 


" 


H 2 


" 


H 2 


" 



(a) Solved. (b) n/a. (c) n/a. (d) Hi, V, Hi, V. 



Fig. 3. The four reachable configurations for a cross cluster. Two of them can be 
solved without affecting the rest of the clusters. 



However, because we have already solved all of the edge clusters, we know 
that the set of possible configurations for our cross cluster is more restricted. 
Both horizontal moves affecting our cross cluster will cause one of the two cubies 
to change color. No matter what state the cross cluster is in, the vertical move 
cannot change the color of only one cubie. Therefore, if the cross cluster is in 
the configuration depicted in Fig. |3(b)| or the configuration depicted in Fig. 3(c) 



then the rest of the solution must perform either the move Hi or the move Hi 
an odd number of times. 

Consider the effect of this on the edge cluster affected by H\ and Hi. Each 
move in the edge cluster causes a swap of two cubies. If an order is placed on 
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the cubies in the cluster, each swap is a permutation of this order, and the 
set of permutations constructible using swaps is equivalent to the permutation 
group on 4 elements, S4. By permutation group theory, if a particular cluster 
configuration can be solved using an even number of swaps, then any solution 
for that cluster configuration has an even number of swaps. We know that the 
edge cluster is already solved, so it can be solved using an even number of swaps. 
So if the rest of the solution contains an odd number of moves H\ and H2 , then 
it must also contain an odd number of edge moves V± and V^. 



H 



WEB — ■ WEB — ■ ( d )EH — ■ 



Vi 




Vi 



V2 




(°) UM — m ( f )M — ■ te)m — ■ ( h )EB- 

H 



V 2 



Fig. 4. The possible configurations of an edge cross cluster. 



Now consider the effect of an odd number of edge moves on the affected 
edge cross cluster. Figure [4] gives the configuration space for that cluster. We 
know that it is currently in the solved state labelled (a) . Any sequence of moves 
which contains an odd number of edge moves V± and V2 will cause the edge cross 
cluster to leave the solved state. So the rest of the solution cannot contain an 
odd number of edge moves V\ and V2. This means that the rest of the solution 
cannot contain an odd number of horizontal moves H± and H2 affecting a single 
cross cluster. So every cross cluster must be in one of the states depicted in 
Figs. 3(a) and |3(d)| each of which can be solved without affecting any other 
clusters using the sequence of moves listed for each state. □ 



3.1 n X n X 1 Upper Bound 

There are n 2 clusters in the nxnxl Rubik's Cube. If we use the move sequences 
given in Fig. [l]to solve each cluster individually, we have a sequence of 0(n 2 ) 
moves for solving the entire cube. In this section, we take this sequence of moves 
and take advantage of parallelism to get a solution with 0(n 2 / log n) moves. 

Say that we are given columns X and rows Y such that all of the cubie clusters 
(x, y) G X x Y are in the configuration depicted in Fig. l(b)| If we attempted 
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to solve each of these clusters individually, the number of moves required would 
beO(\X\-\Y\). 

Consider instead what would happen if we first flipped all of the columns 
x € X, then flipped all of the rows y £ Y, then flipped all of the columns x G X 
again, and finally flipped all of the rows y £ Y again. What would be the effect 
of this move sequence on a particular (x* , y*) G X x Y1 The only moves affecting 
that cluster are the column moves x* and (n — 1 — x*) and the row moves y* 
and (n— 1 — y*). So the subsequence of moves affecting (x* , y*) would consist of 
the column move x* , followed by the row move y* , followed by the column move 
x* again, and finally the row move y* again. Those four moves are exactly the 
moves needed to solve that cluster. 

This idea allows us to parallelize the solutions for multiple clusters, resulting 
in the following lemma. 

Lemma 4. Given an n x n x 1 Rubik's Cube configuration and sets X, Y C 
{0, 1, . . . , \ n/2\ — 1}, if all cubie clusters (x, y) G X x Y are in the same cluster 
configurations, then all cubie clusters (x,y) G X xY can be solved in 0(\X\ + \Y\) 
moves without affecting the rest of the cubies. 

Proof. By Lemma [2j we know that there exists a sequence of moves of length at 
most six which will solve the configuration of a single cubie cluster (x, y) G XxY. 
We write this sequence of moves in a general form as in Lemma [2] We then 
replace each move V\ with a sequence of moves that flips each column x G X. 
Similarly, we replace each move V2 with a sequence of moves that flips each 
column n — x — 1, where x G X. We perform similar substitutions for H± and 
H2, using the rows y and n — y — 1 instead. Because the original sequence of 
moves had length at most six, it is easy to see that the length of the new move 
sequence is 0(|X| + \Y\). 



n 



(a) (b) (c) (d) (e) 

Fig. 5. Solving four cubie clusters at the same time. 



We claim that this new sequence of moves will solve all cubie clusters (x, y) G 
XxY, and that no other cubie clusters will be affected. To see that this is true, 
consider how some cubie cluster (x*, y*) G {0, 1, . . . , \ n/2\— l}x{0, 1, . . . , \ n/2j — 
1} will be affected by the move sequence. If x* ^ X and y* ^ Y, then none of the 
moves in the sequence will affect the position of any cubies in the cubie cluster, 
and therefore the cubie cluster will be unaffected. If x* G X and y* G Y, then 
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the subsequence of moves which affect this cubie cluster will be exactly the se- 
quence of moves necessary to solve this cubie cluster. Otherwise, either x* £ X 
or y* £ Y, but not both. Therefore, the subsequence of moves which affect this 
cubie cluster will be either all vertical, or all horizontal, and each move will occur 
exactly twice. This means that the subsequence of moves which affect this cubie 
cluster will apply the identity permutation to this cubie cluster. □ 

This technique allows us to solve all cubie clusters (x, y) £ X x Y using only 
0(|AT| + \Y\) moves, if each one of those clusters is in the same configuration. 
Our goal is to use this technique for a related problem: solving all of the cubie 
clusters [x,y) £ X x Y that are in a particular cluster configuration c, leaving 
the rest of the clusters alone. To that end, we divide up the columns X according 
to the pattern of rows that are in configuration c, and solve each subset of the 
rows using the technique of Lemma [4] More formally: 

Lemma 5. Suppose we are given an n x n x 1 Rubik's Cube configuration, a 
cluster configuration c, and sets X,Y C {0, 1, . . . , [n/2\ — 1} such that \X\ = £. 
Then all cubie clusters (x, y) £ X x Y that are in configuration c can be solved 
in 0(12 + \Y\) moves without affecting the rest of the cubies. 

Proof. For each y £ Y, we define S y — {x £ X | the cubie cluster (x,y) is in 
configuration c}. For each S C X, we let Ys = {y £ Y \ S y = S}. There are 
2 l possible values for S. For each Ys, we use the sequence of moves which is 
guaranteed to exist by Lemma [4] to solve all (x,y) £ S x Yg. This sequence of 
moves has length 0(\S\ + \ Y S \) = 0(1 + \ Yg\). When we sum the lengths up for 
all Ys, we find that the number of moves is bounded by 

oU-2 l + Y, \Ys\^j =0(£-2 e + \Y\) . 

□ 

Unfortunately, this result is not cost-effective. We need to make sure that 
£ is small enough to prevent an exponential blowup in the cost of solving the 
Rubik's Cube. To that end, we divide up the columns into small groups to get 
the following result: 

Lemma 6. Suppose we are given an n x n x 1 Rubik's Cube configuration, a 
cluster configuration c, and sets X, Y C {0,1,..., \n/2\ — 1}. Then all cubie 
clusters (x,y) £ X x Y that are in configuration c can be solved in 0(\X\ ■ 
\Y\/ log \Y\) moves without affecting the rest of the cubies. 

Proof. Let £ = \ log 2 \Y\, so that 2 l = \f\Y~\. Let k = \\X\/£~\. Partition the set 
X into a series of sets X\ , . . . , X^ each of which has size < £. For each Xi , we 
solve the cubie clusters in Xi x Y using the sequence of moves that is guaranteed 
to exist by Lemma [5j The number of moves required to solve a single Xj, is 

O (£2 l + \Y\) =o((\ log 2 \Y\\ v/jy| + \Y\\ = 0(\Y\). 
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Therefore, if we wish to perform this for k sets, the total number of moves 
becomes: 



0{k-\Y\) = T -L-i_.|y| =0 



|io g2 |y| 1 7 V login 



□ 



As a result of this parallelization, we get the following upper bound on the 
diameter of the configuration space: 

Theorem 1. Given an n x n x 1 Rubik's Cube configuration, all cubic clusters 
can be solved in 0(n 2 / log n) moves. 

Proof. In order to solve the Rubik's Cube, we must solve all cubie clusters 
(x,y) E {0,1,..., |_n/2j-l}x{0,l,..., [n/2\ -1}. To do so, we loop through the 
six possible cluster configurations, then use the sequence of moves guaranteed to 
exist by Lemma [6] to solve all of the cubie clusters which are in a particular con- 
figuration. For a single configuration, the number of moves that this generates 
is 

f [n/2\-[n/2\ \ f n 2 \ 

^ log(|n/2|) J \hgnj- 

When we add this cost up for the six different configurations, the total number 
of moves is 6 • 0(n 2 / log n) = 0(n 2 / log n). □ 



3.2 n X n X 1 Lower Bound 

In this section, we establish the matching lower bound: 

Theorem 2. Some configurations of an nxnxl Rubik 's Cube are f2(n 2 / log n) 
moves away from being solved. 

Proof. Lemma [2] shows that for every possible configuration of a cubie cluster, 
there exists a sequence of moves to solve the cubie cluster while leaving the rest 
of the cubies in the same location. Hence, the inverse of such a sequence will 
transform a solved cubie cluster to an arbitrary cluster configuration without af- 
fecting any other cubies. Not counting the edge cubies and the cross cubies, there 
are (L n /2J — l) 2 cubie clusters, each of which can be independently placed into 
one of six different configurations. This means that there are at least Q^ 71 / 2 ^ 1 ) 
reachable configurations. 

There are 2n possible moves. Therefore, the total number of states reachable 
using at most k moves is at most 

- 1 < (2nrK 
2n - 1 ~ v ' 
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Therefore, if k is the number of moves necessary to reach all states, it must have 
the property that: 

6 ( L n/2j-l) 2 < ( 2 „)*+l, 

( L n/2j-lf<lo g6 ((2n)^) = ( fc+ ^ g 1 6 ° s2 - ! 

(Ln/2j -l) 2 log6 l<k 
log2n 

Hence, there must exist some configurations which are Q(n 2 / \ogn) moves away 
from solved. □ 



4 Diameter of n x n x n Rubik's Cube 

Because the only visible cubies on the nxnxn Rubik's Cube are on the surface, 
we use an alternative coordinate system. Each cubie has a face coordinate (x, y) £ 
{0, 1, . . . , n — 1} X {0, 1, . . . , n — 1}. Consider the set of reachable locations for a 
cubie with coordinates (x,y) £ {0, 1, . . . , n — 1} x {0, 1, . . . , n — 1} on the front 
face. A face rotation of the front face will let it reach the coordinates (n—y—1, x), 
(n — x — l,n— y— 1), and (y,n—x—l) on the front face. Row or column moves will 
allow the cubie to move to another face, where it still has to have one of those 
four coordinates. Hence, it can reach 24 locations in total. For this problem, we 
define the cubie cluster (x,y) to be those 24 positions that are reachable by the 
cubie (x, y). 

We define edge cubies to be cubies with more than one face visible. If a cluster 
has an edge cubie, then all of its cubies are edge cubies. We call such clusters edge 
clusters. We define corner cubies to be cubies with more than two faces visible. 
All corner cubies are in a single cluster known as the corner cluster. If n is odd, 
we must also define several other types of cubies. We first define cross cubies 
to be cubies with face coordinates of the form (x, (n — l)/2) or ((n — l)/2,y). 
If a cluster contains a cross cubie, then all of its cubies are cross cubies, and 
the cluster is called a cross cluster. We define center cubies to be the six cubies 
with face coordinates ((n — l)/2, (n — l)/2). They form a special cluster which 
we will call the center cluster. Our goal in solving the Rubik's Cube will be to 
make each side match the color of its center cluster. Hence, there is no need to 
solve the center cluster. 

Given a particular cluster configuration, this configuration can be converted 
to the solved color configuration by performing a sequence of pairwise cubie 
swaps. If an order is placed on the cubies in the cluster, as in Figure [6j each 
pairwise cubie swap is a permutation of this order, and the set of all cubies 
swaps generates the permutation group on 24 elements, S24 . By permutation 
group theory, if an even (odd) number of swaps can be applied to a color con- 
figuration to transform it to the solved color configuration, then any sequence 
of swaps transforming a configuration to the solved configuration has an even 
(odd) number of swaps. We call this the parity of a color configuration. 
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Just as it was for the bxjixI cube, our goal is to prove that for each cluster 
configuration, there is a sequence of 0(1) moves which can be used to solve the 
cluster, while not affecting any other clusters. For the n x n x 1 cube, we wrote 
these solution sequences using the symbols Hi,H 2 , Vi, V2 to represent a general 
class of moves, each of which could be mapped to a specific move once the cubie 
cluster coordinates were known. Here we introduce more formal notation. 

Because of the coordinate system we are using, we distinguish two types of 
legal moves. Face moves involve taking a single face and rotating it 90° in either 
direction. Row or column moves involve taking a slice of the cube (not one of its 
faces) and rotating the cubies in that slice by 90° in either direction. Face moves 
come in twelve types, two for each face. For our purposes, we will add a thirteenth 
type which applies the identity function. If a is the type of face move, we write 
F a to denote the move itself. Given a particular index i E {1,2,..., [n/2\ — 1}, 
there are twelve types of row and column moves that can be performed — three 
different axes for the slice, two different indices (i and n — i — 1) to pick from, 
and two directions of rotation. Again, we add a thirteenth type which applies 
the identity function. If a is the type of row or column move, and i is the index, 
then we write RC a .i to denote the move itself. 

A cluster move sequence consists of three type sequences: face types a\, . . . , ae, 
row and column types b\, . . . , be, and row and column types c\, . . . , C£. For a cu- 
bie cluster (x,y), the sequence of actual moves produced by the cluster move 
sequence is F ai , RCb ljX , RC Cl . y , . . . , F ae , RC 0UX , RC C(iV . A cluster move solution 
for a cluster configuration d is a cluster move sequence with the following prop- 
erties: 



1. For any (x,y) G {1,2,..., |n/2j - 1} x {1, 2, . . . , |n/2j - 1}, if cluster (x,y) 
is in configuration d, then it can be solved using the sequence of moves 

2. The move sequence F ai , RCb ltX , RC Cl ^ y , . . . , F ae , RCb e , x , RC Ct , y does not af- 
fect cubie cluster (y,x). 

3. All three of the following sequences of moves do not affect the configuration 
of any cubie clusters: 

Fai ! ^^61 .x ? F a2 , RC 0l ?a; , . . . , F a ^ , RCb^ x , 
Fai 5 RCci ,yi F a2 ; RCci ,yi • • • ; F af , RC C £,y', 
Fai 1 F a2 1 ■ ■ ■ j Fai 1 

Our goal is to construct a cluster move solution for each possible cubie cluster 
configuration, and then use the cluster move solution to solve multiple cubie 
clusters in parallel. 

In the speed cubing community, there is a well-known technique for solving 
n x n x n Rubik's Cubes in 0(n 2 ) moves, involving a family of constant-length 
cluster move sequences. These cluster move sequences can be combined to con- 
struct constant-length cluster move solutions for all possible cluster configura- 
tions, which is precisely what we wanted. 
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Before dealing with the general solution, we address fixing the parity of the 
cubie clusters. This allows us to assume that the parity of all clusters is even for 
the remainder of the paper. 




Fig. 6. An ordering of a 24-cubie cluster. 



Lemma 7. Given a solvable nxnxn Rubik 's Cube configuration, the parity of 
all cubie clusters can be made even in 0(n) moves. 

Proof. By definition, the center cluster is already solved, and therefore we may 
assume that its parity is already even. In addition, any cluster containing at 
least two indistinguishable cubies can be considered to have even parity or odd 
parity depending on the chosen label for the indistinguishable cubies. Therefore, 
we may assume that all such clusters have even parity. This means that all non- 
edge clusters, including the non-edge cross clusters, can be assumed to have the 
correct parity no matter how many moves are performed. So we need only fix 
the parity of the edge clusters. 

We begin by fixing the parity of the corner cluster and the edge cross cluster 
(if it exists). Because the cube is solvable, we know that the corner cluster and the 
edge cross cluster can be solved. Because the corner cluster has 0(1) reachable 
states and the edge cross cluster has 0{1) reachable states, we know that we can 
solve both in 0(1) moves. Once those two clusters are solved, we know that their 
parities must be correct. Therefore, there is a sequence of 0(1) moves which can 
be used to fix the parity of those clusters. 

Consider the effect of a face move on the parity of a non-cross edge cluster. 
For a particular edge cluster, a face move affects the location of eight cubies, 
due to the fact that a face move also acts like a row or column move for edge 
cubie groups. The color of each cubie is rotated 90° in the direction of the face's 
rotation. This means that the permutation applied consists of two permutation 
cycles each containing four elements. Therefore, if the elements whose colors 
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are changed are 1,2, 3,..., 8, then we can write the applied permutation as 
(1 3)(1 5)(1 7) (2 4) (2 6) (2 8), or six swaps. Hence face moves cannot be used to 
fix the parity of the edge clusters. 

Now consider the effect of a row or column move on the parity of a non-cross 
edge cluster. A row or column move affects the colors of four cubies, one for each 
corner of the rotated slice. The color of each cubie is transferred to the adjacent 
cubie in the direction of the move rotation. So if the elements whose colors are 
changed are 1, 2, 3, 4, then the applied permutation is (1 2 3 4) = (1 2)(1 3)(1 4). 
Because the permutation can be written as an odd number of swaps, the parity 
of the cluster has changed. Note, however, that there is exactly one edge cluster 
whose parity is affected by this movement. Therefore, we can correct the parity of 
each odd edge cluster by performing a single row or column move that affects the 
cluster in question. The total number of moves required is therefore proportional 
to the number of edge clusters, or 0(n). □ 

Lemma 8. For each permutation in Table [7| there exists a cluster move se- 
quence of length 0(1) which can be used to apply the given permutation to the 
cluster while applying the identity permutation to every other cluster. 

Proof. First, we introduce some more notation for Rubik's Cube moves. It is 
more specific than the formalism of the cluster move sequence introduced above, 
making it easier to express a particular set of moves, but is more difficult to 
analyze when we parallelize this move sequence. 

Consider facing the cube from in front (the front face is the face in the 
xz-plane with the more negative y- value). From this view, there are horizontal 
moves that rotate a slice of the cube parallel to the xy-plane. Rotating the zth 
slice from the top in the clockwise direction 90° as viewed from above the cube is 
denoted by Hf w . Rotating this same slice in the opposite direction is denoted by 
jjccw _ Similarly, rotating a slice parallel to the yz-plane is a vertical move, and 
rotating the jth slice from the left side of the cube in the clockwise direction 
as viewed from left of the cube is denoted Vj , while the counter-clockwise 
version is denote V^ cw . Finally, we define a third type of move which rotates a 
slice parallel to the front face. Counting inward from the front face, we denote 
rotating the fcth slab 90° clockwise as D^ w , while rotating it 90° in the opposite 
direction is D% cw . See Figure [TJ 

We claim that the move sequence S = V£ cw o Dg w o V™ o Dg cw o H° w o 
Dg w o V£ cw o Dg cw o V° w o H§ cw swaps the colors of three cubies in a single 
cluster, while leaving the color configurations of all other clusters the same. This 
move sequence is attributed to Ingo Schiitze [24) , but is fairly well-known within 
the speed cubing community, so its origins are unclear. The effect of applying S 
is shown by case analysis of individual cubies. When applying the sequence S, 
only cubies lying on the union of the slices rotated by the V n , D n , and Hq moves 
are affected. 

Blocks lying on the bottom face are unaffected, as they never reach the top 
face and thus only have the subsequence Vf cw o D% w o V° w o Dg cw o Dg w o 
Vn ° D^ l cw o Vf^ w applied, which does not affect the final location of a 
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Fig. 7. The definitions of the various moves defined on an n x n x n Rubik's Cube. 



cube. Blocks starting on the back and right faces never move to the upper face, 
as each move that could place these on the upper face (moves V^ 1 w and Dg cw ) 
is preceded by a move rotating these cubies onto the bottom face (moves V£ cw 
and Dg w ). 

Now consider the cubies on left face in the slice rotated by Dg^ and Dg cw . 
Exactly one of these cubies is in the same cluster as the cubie on the upper face 
that lies in the slices rotated by both V^ w and fl^ ff . All other such cubies 
cannot be affected by Dg^ and Dg cw moves, so applying S has the same 
effect as applying the sequence Dg w o Dg cw o H§ w o Dg w o Dg cw o H§ cw . 
Canceling the Dg w and Dg cw moves yields H§ w ', H§ cw and thus these cubies 
are unaffected. Now consider the single cubie on the left face in the same cluster 
as the cubie on the upper face that lies in the slices rotated by both Vfg w and 
Dg^ . Tracing the locations visited by this cubie when S is applied to it shows 
that the cubie travels to the upper face (via Dg^), the front face (via L^ w ), 
the upper face (via V^ cw ), and then the left face (via D ( ^ JW ). So the cubie's 
location is unaffected by S. 

Next consider the cubies on the front face. Exactly one of these cubies is in 
the same cluster as the cubie on the upper face that lies in the slices rotated 
by both V£ w and D^ w . All other such cubies cannot be affected by V% w 
and V£ CW moves, so applying S has the same effect as applying the sequence 
Dg w o Dg cw o Hg w o Dg w o Dg cw o H § cw . For the cubie in the same cluster 
as the cubie on the upper face that lies on both the V^ w and Dg^ slices, 
applying S to it results in moving it, in sequence, to the left side of the upper 
face (Yn CW ), r ig n t f ace (Dg^), upper face (Dg cw ), the back side of the upper 
face (Hq W ), and the left side of the upper face (Hq CW ). So applying S to this 
cubic moves it to the location of the cubie in its cluster in the left side of the 
upper face. 

Finally, consider the cubies on the upper face. Divide the cubies into three 
sets based upon the cubies in their clusters. Each cubie on the upper face either 
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is in the cluster containing the cubie lying in slices rotated by both V^ w and 
' , or is in a cluster containing a cubie lying in exactly one of the slices 
rotated by V^ w and , or is in a cluster that does not contain any elements 
in either of the slices rotated by V£ and fl^ ff . If a cubie lies in a cluster that 
does not contain any elements in either of the slices rotated by V n and D^, 
then it cannot be affected by V^ w , V% cw ', D° w , or Dg cw moves. So applying 
S to it is equivalent to applying Hq W ,Hq CW to it, and thus does not affect 
its position. If a cubie lies in the V^ w slice but is not in the cluster containing 
the cubie in both V^ W and D 1 ^ slices, then applying S to it is equivalent to 
applying V° cw o V° w o Hg w o V° cw o V™ o ff™ (the identity) as it can 
never lie in the slice. Similarly, if a cubie lies in the D„ slice but is not 

in the cluster containing the cubie in both V^ w and slices, then it can 

never lie in the w slice, and so applying S to it is equivalent to applying 
Dg w o Dg cw oH° w o Dg w o Dg cw oH§ cw , the identity. 

Now consider the four cubies on the upper face in the same cluster as the 
cubie lying in both V„ and slices. The cubie lying on the left side of the 

upper face is the cubie lying in both the V£ W and slices, and applying 

S to it results in moving it, in sequence, to the upper side of the back face 
(V£ cw ), the left side of the upper face (V^ w ), the bottom side of the left face 
(D'^ l cw ), the left side of the upper face (D^^), the upper side of the back face 
(Vn CW ), the left side of the upper face (V„ ), and the front side of the upper 
face (Hq CW ). So the result is moving the cubie from the left side to the front 
side of the upper face. The cubie lying on the front side of the upper face initially 
lies in neither the V£ w nor the slice. So the moves in S before Hq W do 

no affect it. Applying the subsequence of moves starting at H§ w move it, in 
sequence, to the left side of the upper face (Hq W ) 7 the upper side of the right 
face (D^^), the left side of the upper face (Dg cw ), and the left side of the 
front face (V^ w )- So the result is moving the cubie from the front side of the 
upper face to the left side of the front face. The cubie lying on the back side 
of the upper face moves visits the back face (V^ cw , V^ w ) and the right side 
of the upper face (Hq W , Hq CW ), but is not affected by S, and the cubie lying 
on the right side of the upper face only visits the front side of the upper face 
{H§ w , H§ cw ) and thus is not affected by S. 

In summary, applying S to the cube results in changing the locations of 
exactly three cubies of a single cluster, those lying on the left and front sides of 
the upper face, and the cubie lying on the left side of the front face. All other 
cubies of the cube are left unchanged. The three affected cubies each move into 
the location of another, with the cubie on the left side of the upper face moving 
to the location of the cubie on the front side of the upper face, the cubie on 
the front side of the upper face moving to the location of the cubie on the left 
side of the front face, and the cubie on the left side of the front face moving to 
the location of the cubie on the left side of the upper face. As seen in Figure [8j 
the result of applying S to a cluster is to "rotate" the locations of three cubies, 
and in effect rotate the colors of the cubies at these three locations. Note that 
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Fig. 8. The resulting movement of the three cubies of a cluster resulting from applying 



the move sequence S = V£ 



_ ttCCW 
O Hq 



o H o L>„ 



oDi, 



(1 2 12) (4 3 10) (2 4 11) (3 1 9) (5 12 8) (20 13 19) 

(12 20 24) (13 5 4) (6 11 5) (19 14 18) (11 19 22) (14 6 3) 

(7 10 6) (18 15 17) (10 18 21) (15 7 1) (8 9 7) (17 16 20) 

(9 17 23) (16 8 2) (21 22 13) (24 23 15) (22 24 16) (23 21 14) 



Table 1. A set of 24 permutations that can be applied to a cluster while leaving all 
other clusters unchanged. 



rotating three elements is equivalent to performing a pair of swaps, just as the 
permutation (11 19 22) = (11 19)(11 22) 

The choices for which faces are front, left and upper are arbitrary, and there 
are 24 choices for such a set (six choices for the front face, and four choices for 
the upper face for each choice of front face). For a specific cluster, each choice 
of front, left and upper faces implies a permutation resulting from applying S. 
Using the ordering of the cubies in a cluster defined in Figure [6j a resulting set 
of 24 permutations is generated (as seen in Table [TJ. □ 

Lemma 9. Any cluster configuration with even parity can be solved using a 
cluster move solution of length 0(1). 

Proof. By Lemma[8j there exist a set of permutations that can be applied to any 
single cluster while applying the identity permutation to every other cluster. It 
can be shown using the GAP software package [27] that this set of permutations 
generates A 2 4, the set of even permutations on 24 elements. Thus any even 
permutation can be written as a composition of these permutations and has 
an inverse that can also be written as the composition of these permutations. 
Because each cluster has finite size, the inverse composition must have 0(1) 
length. So there exists a 0(l)-length sequence of moves that can be applied to 
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the cube that results in one cluster having the solved color configuration, and 
all other cubie clusters having unchanged color configurations. □ 

4.1 n X n X n Upper Bound 

As in the nxnxl case, our goal here is to find a way to solve several different 
clusters in parallel, so that the number of moves for the solution is reduced from 
0(n 2 ) to 0(n 2 / log n). We use the same parallelization technique as we did for 
the nxnxl Rubik's Cube, although some modification is necessary because of 
differences in the types of moves allowed. 

Lemma 10. Suppose we are given an n x n x n Rubik's Cube configuration and 
sets X,Y C {0, 1, ... , ln/2\ - 1} such that X n Y = 0. If all cubie clusters 
in X x Y have the same cluster configuration, then they can all be solved in a 
sequence ofO(\X\ + \Y\) moves that only affects cubie clusters in (X x7)U (X x 
X)U(YxY). 

Proof. Let d be the cluster configuration of all of the clusters in X x Y. By 
Lemma [9j we know that there is a constant-length cluster move solution for d. 
Let ax,..., a m , b\, . . . , b m , and c±, . . . , c m be the type sequences of that cluster 
move solution. Let xx,-..,Xt be the elements of X, and let yx,...,yk be the 
elements of Y. To build a sequence of moves to solve all the clusters in X x Y, 
we begin by defining: 

BuLKi = F ai , RC bi iXl , RC bi iX2 , . . . , RCf, itXe , RC Ci: y 1 , RC Cit y 2 , . . . , RC Ci ,y k . 

Note that this sequence consists of \X\ + |Y| + 1 moves. We then construct the 
full sequence of moves to be the following: 

BULKi, BULK 2 , . . . , BULK m . 

Because the original sequence of moves had length 0(1), we know that £ = O(l), 
and so the total length of this sequence will be 0(|X| + |Y| + 1). 

Consider the effect of this constructed move sequence on a cubie cluster 
[x, y) G {0, 1, ... , [n/2\ - 1} x {0, 1, ... , \n/2\ - 1}. First, consider the effect on 
(x,y) G X x Y. The subsequence of moves which affect this cubie cluster will 
be F ai , RCb llX ,RCc lt y, • • • j F ai , RCb e , x , RC ct . y . This is precisely the set of moves 
generated by the cluster move solution for solving the cluster (x, y) so this cluster 
will be solved. The subsequence of moves affecting the cluster (y, x) € Y x X. will 
be the same as the subsequence for the cluster (x, y). By Property [2] of cluster 
move solutions, this cluster will not be affected by the sequence of moves. 

We need not consider the effect on clusters XxX or YxY, because our lemma 
places no restrictions on what happens to those clusters. So all of the remaining 
clusters we must consider have at most one coordinate in X U Y. Suppose we 
have some x € X and some z £ X U Y. Then the sequence of moves affecting 
the clusters (x, z) and (z,x) will be F ai , RC bl . x , F a2 , RC b2tX . . . , F aei RC be , x - By 
Property [3] of cluster move solutions, this sequence of moves does not affect 
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any clusters, and so (x,z) and (z,x) will both remain unaffected. Similarly, 
suppose we have some y £ Y and some z ^ X U Y. Then the sequence of moves 
affecting the clusters (y, z) and (z, y) is F ai ,RC cl>v , F a2 ,RC C2iV . . . , F ae ,RC ce<y . 
According to Property [3j this move sequence does not affect the configuration 
of clusters (y, z) or (z,y). Finally, consider the effect on some cluster (w,z) £ 
X U Y x X U Y . Then the sequence of moves affecting (w, z) is F ai , F a2 , . . . , F ae . 
Once again, by Property[3]of cluster move solutions, this move sequence will not 
affect the configuration of cluster (w,z). □ 

Now say that we are given a cluster configuration d and a set of columns X 
and rows Y such that X C\Y = 0. Using the same row-grouping technique that 
we used for the n x n x 1 case, we show the following lemma. 

Lemma 11. Suppose we are given an n x n x n Rubik's Cube configuration, a 
cluster configuration c, and sets X, Y C {0, 1, . . . , \ n/2\ — 1}, such that £ = \X\ 
and X P\ Y = 0. Then there exists a sequence of moves of length 0{£2 l + \Y\) 
such that: 

— All cubie clusters (x,y) £ X x Y in configuration c will be solved. 

— All cubie clusters (x,y) £ (X x X) U (Y x Y) may or may not be affected. 

— All other cubie clusters will not be affected. 

Proof. For each row y £ Y , let S y = {x £ X \ cubie cluster (x, y) is in configura- 
tion c}. For each set S C X, let Y s = {y £ Y \ S y = S}. Because SCI, there 
are at most 2 £ different values for S. For each S, we will use the results of Lemma 



10 to construct a sequence of moves to solve each cubie cluster (x,y) £ S xY§. 
This move sequence will have length 0(15*1 + \Ys\) = 0(£+ \Yg\)]. When we sum 
up this cost over all sets S C X, we get the following number of moves: 

O (^£-2 e + J2\ Y s\ ] j =0(£-2 e + \Y\). 



□ 



Just as we did for the nxnxl Rubik's Cube, we avoid exponential blow-up 
by dividing the set of columns X into smaller groups, solving each such group 
individually. More formally: 

Lemma 12. Suppose we are given an n x n x n Rubik's Cube configuration, a 
cluster configuration c, and sets X, Y C {0, 1, . . . , \ n/2\ — 1}, such that X n Y = 
0. Then there exists a sequence of moves of length 0(\X\ ■ |Y"|/log |V|) such that: 

— All cubie clusters (x,y) £ X x Y in configuration c will be solved. 

— All cubie clusters (x,y) £ (X x X) U (Y x Y) may or may not be affected. 

— All other cubie clusters will not be affected. 



Proof. Let £ = \ log 2 \Y\, so that 2 e = y/\Y~\. Let k = \\X\/£]. Partition the set 
X into a series of sets Xj , . , . , Xf. each of which has size < £. For each X, , we 



Algorithms for Solving Rubik's Cubes 



21 



solve the cubie clusters in Xi x Y using the sequence of moves that is guaranteed 
to exist by Lemma The number of moves required to solve a single Xi is 

O (£2 e + \Y\) = O (Q log 2 \Y\\ y/\y\ + \Y\) = 0(\Y\). 

Therefore, if we wish to perform this for k sets, the total number of moves 
becomes 

□ 

To finish constructing the move sequence for the entire Rubik's Cube, we 
need to account for two differences between this case and the nxnxl case: the 
requirement that X n Y = and the potential to affect clusters in (X x X) U 
(Y x Y). 

Theorem 3. Given an n x n x n Rubik's Cube configuration, all cubie clusters 
can be solved in 0(n 2 j log n) moves. 

Proof. In order to solve the Rubik's Cube, we must first fix the parity. Using the 
techniques of Lemma[7| we can perform this step in 0{n) moves. Then we solve 
each edge cluster individually. Each edge cluster requires 0(1) moves to solve, 
and there are 0{n) edge clusters, so this preliminary step takes time 0(n). 

Once the edges have been solved, we want to solve the non-edge clusters. 
Let k = yJn/2. Partition {0, 1, ... , \n/2\ — 1} into a series of sets Gi,...,Gk 
each of which has size < k. For each pair i,j such that i ^ j and each cluster 



configuration c, we use the sequence of moves guaranteed to exist by Lemma 12 
to solve all (x, y) £ G{X Gj with the configuration c. This ensures that all cubie 
clusters (x,y) £ Gj x Gj will be solved. For each i, we must also solve all cubie 
clusters (x,y) £ G{ x d. There are k 2 (k — l)/2 such cubie clusters, so we can 
afford to solve each such cubie cluster individually. 

What is the total number of moves required? For a single pair i ^ j and 
a single configuration c, the number of moves required will be 0{k 2 / 'log k) = 
0(n/logn). There are a constant number of possible configurations, so solving 
a single pair i,j for all configurations will also require 0(n/ log n) moves. There 
are k 2 — k such pairs, for a total of 0(n 2 / logn). If we then add in the extra 0(k 3 ) 
from the diagonals, then the total number of moves will be 0(n 2 / logn + n 3 / 2 ) = 
0(n 2 /logn). □ 



4.2 n X n X n Lower Bound 

We derive a matching lower bound using a technique identical to the one used 
for the nxnxl lower bound: 

Theorem 4. Some configurations of an nxnxn Rubik's Cube are fi(n 2 j log n) 
moves away from being solved. 
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Proof. Lemma [9] shows that for every possible configuration of a cubie cluster, 
there exists a sequence of moves to solve the cubie cluster while leaving the rest 
of the cubies in the same location. Hence, the inverse of such a sequence will 
transform a solved cubie cluster to an arbitrary configuration without affecting 
any other cubies. Not counting the edges, there are ([n/2\ — l) 2 cubie clusters, 
each of which can be independently placed into one of (24!)/(4!) 6 different color 
configurations. This means that there are at least 

24 , x (L«/2j-i) 2 



(4!) E 

reachable configurations. 

There are 6n possible moves, so the total number of states reachable using 
at most k moves is at most 

- 1 < (6nr\ 
6n — 1 

Therefore, if k is the number of moves necessary to reach all states, it must have 
the property that 

/ 94! \ (L«/2J-1) 2 



V(4!) e 

(Ln/2J - if ■ log (J?L\ < log ((6n) fc+1 ) = (k + 1) log(6n), 

(Ln/2j-l) 2 log(^r) 
log(6n) 



- 1 < k. 



Hence, there must exist some configurations which are fi(n 2 / \ogn) moves away 
from solved. □ 



5 Optimally Solving a Subset of the n x n x 1 Rubik's 
Cube is NP-Hard 

In this section, we consider a problem which generalizes the problem of com- 
puting the optimal sequence of moves to solve a Rubik's Cube. Say that we are 
given a configuration of an n x n x 1 Rubik's Cube and a list of important cubies. 
We wish to find the shortest sequence of moves that solves the important cubies. 
Note that the solution for the important cubies may cause other cubies to leave 
the solved state, so this problem is only equivalent to solving annxnxl Rubik's 
Cube when all cubies are marked important. 

In this section, we prove the NP-hardness of computing the length of this 
shortest sequence. More precisely, we prove that the following decision problem 
is NP-hard: is there a sequence of k moves that solves the important cubies of 
the nxnxl Rubik's Cube? Our reduction ensures that the cubies within a single 
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cubie cluster are either all important or all unimportant, and thus it does not 
matter whether we aim to solve cubies (which move) or specific cubie positions 
(which do not move) . Therefore the problem remains NP-hard if we aim to solve 
the puzzle in the sense of unifying the side colors, when we ignore the colors of 
all unimportant cubies. 

Certain properties of the Rubik's Cube configuration can affect the set of 
potential solutions. For the rest of this section, we will consider only Rubik's 
Cubes where n is odd and where all edge cubies and cross cubies are both 
solved and marked important. This restriction ensures that for any cluster, the 
number of horizontal moves and vertical moves affecting it must both be even. 
In addition, we will only consider Rubik's Cubes in which all cubie clusters 
are in the cluster configurations depicted in Figures |l(a)| |l(b)[ and |l(d)[ This 



restriction means that the puzzle can always be solved using moves only of types 
Hi and V\ . This combination of restrictions ensures that each unsolved cluster 
must be affected by both vertical and horizontal moves. 

Suppose that we are given a configuration and a list of important cubies. Let 
u r be the number of rows of index < \ n/2\ that contain at least one important 
unsolved cubie. Let u c be the number of columns of index < \n/2\ that contain 
at least one important unsolved cubie. Then we say that the ideal number of 
moves for solving the given configuration is 2(u r + u c ). In other words, the ideal 
number of moves is equal to the smallest possible number of moves that could 
solve all the important cubies. An ideal solution for a subset of the cubies in a 
particular nxnxl puzzle is a solution for that set of cubies which uses the ideal 
number of moves. For the types of configurations that we are considering, the 
ideal solution will contain exactly two of each move, and the only moves that 
occur will be moves of type H± or V±. 

Definition 1. Let Ik(m) denote the index in the solution of the kth occurrence 
of move m. 

For our hardness reduction, we develop two main gadgets. The first gad- 
get forces an ordering on the second occurrences of row moves, and is used in 
the construction of the second gadget. The second gadget forces a betweenness 
constraint on the ordering of the first occurrences of row moves. 

Lemma 13. Given two sets of columns Xi, X 2 C {0, 1, . . . , \n/2\ — 1} ; there 
is a gadget using three extra rows and two extra columns ensuring that, for all 
X\ £ X\ and for all Xi £ X2, -^2(^1) < -^2(^2)- As a side effect, this gadget also 
forces 

max ii(xi) < min -^(^l) cind max Ii(x2) < min 12(^2)- 

116X1 iiEJfi x 2 £X 2 

Proof. Let 2/1,2/2,2/3 < L n /2J be three rows not used elsewhere in the construc- 
tion. Let x\,X2 < |_ n /2j be two columns not used elsewhere in the construc- 
tion. Make cubie clusters (x\, y%) and (£2,2/3) have the configuration depicted 



in Fig. 1(b) make cubie clusters and (£2,2/2) have the configuration 
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depicted in Fig. l(d)| and make cubie cluster (£2,2/1) remain in the solved con- 



figuration. Mark all of these cubic clusters as important. 

These cluster configurations enforce the following constraints: 

h(x{) < him) < I 2 (xi) < him), h{x 2 ) < him) < h{x 2 ) < him), 
him) < h(xi) < h{yx) < h{xi), him) < hix 2 ) < him) < hix 2 ). 

We can use these inequalities to construct the following chains: 

him) < hixi) < him) < hix 2 ) and him) < h(xi) < him) < hix 2 ). 

Because (£2,2/1) must remain solved, and because of the above constraints, there 
is only one possible ordering for the pairs of moves involving m and x 2 : him) < 
him) < hix2) < hix 2 ). If we combine this with the constraint hix 2 ) < 
him) < hix 2 ) < him), we know that him) < him) < him) < him)- 

Now, for each x\ G X\ and x 2 G X 2 , make cubie clusters (0:1,2/1) and ix 2 , m) 
have the configuration depicted in Fig. 1(b) | and mark them important. No 



other cubie clusters involving m,y 2 ,m 01 x i,%2 should be marked important. 
This constraint ensures that for all x\ G X\, hix\) must lie between him) and 
hijji)- In addition, for all choices of x 2 G X 2 , hix 2 ) must lie between him) 
and I 2 im)- Therefore, hixi) < hix 2 ). 

As a side effect, these constraints ensure that for all xi G X\, Ix(x\) must 
lie before him), while ^(^l) lies after him)- Therefore, 

max 7i(o:i) < min 7 2 (o:i)- 

116X1 116X1 

In addition, the constraints ensure that for all choices of x 2 G X 2 , Iiix 2 ) must 
lie before Iiim), while hix 2 ) lies after Iiim)- This ensures that 

max 11(0:2) < min hix 2 ). 

We have shown that these gadgets can enforce a constraint. We must also 
show that these gadgets do not enforce any constraints other than the ones 
expressed in the lemma. In other words, given any solution which satisfies the 
requirements given in the lemma, we must be able to insert the moves for our 
new rows and columns in such a way that all important clusters will be solved. In 
order to make sure that clusters (£1,2/2), ix 2 ,m), (£1,2/1), (£2,2/2), and (£ 2 ,2/i) 
are all solved, it is sufficient to ensure that the moves X\, x 2 , y\, 2/2, 2/3 occur in 
the following order: 

h im ) , h (£ 1 ) , h im ) , h (2/2 ) , h {x\ ) , h (x 2 ) , h (2/2 ) , h (2/3 ) , h (£2 ) , h im)- 

So if we can find the correct way to interleave this sequence with the existing 
move sequence, we will have a sequence that solves all clusters. 

First, we observe that the only important clusters in row y 2 and in columns 
£1 and £2 are the ones which the above sequence will solve. So we need only 
determine how to correctly interleave the moves for rows 2/1 and 2/3 with the 
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existing move sequence. We know from the statement of the lemma that the 
existing move sequence satisfies the following constraints: 

max I\(xi) < min ^(^l) < max I 2 (xi) < min ^2(^2)- 

n£Xi 116X1 x 2 £X 2 

So to ensure that each cluster (xi,yi) is solved, we insert the two copies of the 
move yi to satisfy: 

max h{xi) < h(yi) < min I 2 (xi) < max I 2 {xi) < h{yi) < min 72(^2)- 

I16X1 x-lEXx x 1 £X 1 x 2 £X 2 

Similarly, we know from the statement of the lemma that the existing move 
sequence satisfies these constraints: 

max Ii(x2) < min I 2 {x 2 ) < max 12(^2)- 

126-^2 I2€^2 I2£^ 2 

So we insert the two copies of the move 2/3 as follows, to ensure that each cluster 
(£2,2/3) is solved: 

max h(x 2 ) < hih) < min £2(212) < max I 2 [x 2 ) < him)- 

X 2 GX 2 I2GA2 I2€A 2 

To ensure that h(y~i) < hivi) < ^1(2/3) < -M2/3), we note that the above two 
constraints do not actually determine the ordering of I 2 (yi) and /i(j/3). So we 
can pick an ordering where I 2 {y\) < ^1(2/3), which will ensure that all clusters 
are solved. □ 



Lemma 14. Given three columns xi,x 2 ,X3 < \ n/2\, there is a gadget using six 
extra rows and two extra columns ensuring that Ii(x 2 ) lies between I\{x\) and 
7i(a;3). As a side effect, this gadget also forces I 2 (x 2 ) < I 2 (x\), I 2 {x 2 ) < I 2 (xs), 
and 

max Iii x ) < mm 7 2 (cc). 

x£{xi,x 2 ,x 3 } 16(11,12,13) 



Proof. Use a copy of the gadget from Lemma 13 to force I 2 (x 2 ) < I 2 (xi) and 
l2(x 2 ) < I 2 {x-$). Let 2/1:2/2,2/3 be three rows not used elsewhere in the construc- 
tion. Make each cubie cluster in the set 

{(2:1,2/2), (£1,2/3), (£2,2/1), (£2,2/3), (£3,2/1), (£3,2/2)} 



have the configuration depicted in Fig. 1(d) Make cubie clusters (21,2/1), (x 2 ,y 2 ), 



and (2:3,2/3) remain solved. Mark all of these cubie clusters important. Now we 
consider what form an ideal solution could take, given these restrictions. 

First, consider the case where 11(2:2) < /i(£i) and Ii(x 2 ) < 11(2:3). Because 
of the configurations of cubie clusters (2:2,2/1) arid (2:2,2/3), we know that 

h{vi) < h(x 2 ) < h(xi), h(y 3 ) < h(x 2 ) < h(x 3 ), 

h(yi) < h{x 2 ) < h(xi), ^2(2/3) < h{x 2 ) < I 2 (x 3 ). 
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Fig. 9. A sample of the betweenness gadget from Lemma [T4| Important cubies are red 
(solved) and blue (unsolved). Unimportant cubies are white. Any ideal solution must 
either have h(xx) < h(x2) < h(xs) or h(x3) < h(x2) < h(xi). 

Given these constraints, and the requirement that the cubie cluster (£1,2/1) re- 
main solved, it must be that I 2 (yx) < h{xx)- Similarly, because the cubie cluster 
(#3>J/3) must remain solved, it must be that 12(2/3) < Ix(x 3 ). In order to solve 
the cubie clusters (£1,2/3) and (£3,2/1), it must be that Ix(xx) < I 2 (y 3 ) an d 
h{x 3 ) < h{vi)- Therefore 

h{xx) < I 2 (y 3 ) < h(x 3 ) < I 2 {yi) < h(xi), 

a contradiction. Hence this case cannot happen. 

Now consider the case where Ii(xi) < Ix{x 2 ) and h(x 3 ) < Ix(x 2 ). Then we 
know that the following inequalities hold: 

h(xi) < h(x 2 ) < I 2 (x 2 ) < h(xi) and hfa) < h(x 2 ) < I 2 (x 2 ) < I 2 (x 3 ). 

Because of the configurations of cubie clusters (x 2 ,yi) and (x 2 , 2/3), this sand- 
wiching implies that 

h{x x ) < I 2 (yi) < h(xi), h(xi) < 12(2/3) < h(xi), 

h(x 3 ) < h{yi) < h{x 3 ), h(x 3 ) < ^2(2/3) < h{x 3 ). 

To ensure that cubie cluster (aci, j/i) still remains solved, we need Ii(xi) < Ii(yi). 
Given the configuration of cubie cluster (xx,y 3 ), we need Ii(y 3 ) < Ii(xi). To 
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ensure that cubie cluster (13,2/3) still remains solved, we need 71(2:3) < 71(7/3). 
Given the configuration of cubie cluster (2:3,2/1), we need Ii{yi) < Ii(x 3 ). Thus 

Mite) < h(xi) < h(Vi) < h{x 3 ) < Ii(y 3 ), 

a contradiction. Hence this case is also impossible. 

Because neither of the two cases are possible, Ii(x2) must lie between Ii{x\) 
and 11(0:3), which is precisely what we wanted this gadget to enforce. We must 
also show that this gadget does not enforce any constraints other than the ones 
expressed in the lemma. Given any solution which satisfies the requirements 
given in the lemma, we must be able to insert the moves for the new rows and 
columns in such a way that all important clusters will be solved. The extra 
constraints on the existing move sequence ensure the following: 

max Ii(x) < min 72(2;). 

16(11,13} x£{x!,x 3 } 

So we know that we can place the moves for the extra rows and columns used by 



the gadget from Lemma 13 We need only determine where to insert the moves 
for the three extra rows added by this gadget. 

The constraints given in the statement of the lemma allow for four different 
possible orderings of all of the x\, x%, x$ moves. We consider each case separately. 

1. I\{x\) < I\{x 2 ) < h(xa) < 12(2:2) < h(xi) < h(xa). Then we insert the 
moves ?/i,j/2, 2/3 so that the following is a subsequence of the move sequence: 

y2,y3,x 1 ,y 1 ,x 2 ,y 3 ,x 3 ,y 1 ,x 2 ,y2,x 1 ,x 3 . 

2. Ii(xi) < I\{x 2 ) < 71(2:3) < ^2(2:2) < ^2(2:3) < h{xi). Then we insert the 
moves 2/1,2/2, 2/3 so that the following is a subsequence of the move sequence: 

2/2, 2/3, xi, 2/1, x 2 , 2/3, x 3 , 2/1, x 2) 2/2, 2:3, xx 

3. Ii{x 3 ) < Ii(x 2 ) < Ii(xi) < I 2 (x 2 ) < h(xi) < I 2 {x 3 ). Then we insert the 
moves 2/1,2/2, 2/3 so that the following is a subsequence of the move sequence: 

2/i: 2/2, x 3 , 2/3, x 2 , 2/1, x x , 2/3, x 2 , 2/2, xi, x 3 

4. I\{x 3 ) < I\{x 2 ) < h(xi) < I 2 (x 2 ) < h{x 3 ) < I 2 (xi). Then we insert the 
moves 2/1,2/2) 2/3 so that the following is a subsequence of the move sequence: 

yi,y2,x 3 ,y 3 ,x 2 ,y 1 ,x 1 ,y 3 ,x 2 ,y 2 ,x 3 ,x 1 

□ 

The betweenness problem is a known NP-hard problem |8ll9j . In this problem, 
we are given a set of triples (a, 6, c), and wish to find an ordering on all items 
such that, for each triple, either a < b < c or c < b < a. In other words, for each 



triple, b should lie between a and c in the overall ordering. Lemma 14 gives us a 



gadget which would at first seem to be perfectly suited to a reduction from the 
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betweenness problem. However, because the lemma places additional restrictions 
on the order of all moves, we cannot reduce directly from betweenness. 

Instead, we provide a reduction from another known NP-hard problem, Not- 
All-Equal 3-SAT [8 23 . In this problem, sometimes known as 7^-SAT, the input 
is a 3-CNF formula <j) and the goal is to determine whether there exists an 
assignment to the variables of <fi such that there is at least one true literal and one 
false literal in every clause. Our reduction from ^-SAT to ideal Rubik solutions 
closely follows the reduction from hypergraph 2-coloring to betweenness [!§] . 

Theorem 5. Given a ^-SAT instance 4>, there exists annxnxl configuration 
and a subset of the cubies that has an ideal solution if and only if<fi has a solution, 
i.e., belongs to ^-SAT. 

Proof. We start with a single column r center < [n/2\ . For each variable in 0, 
we construct two columns s Xi ,s^r < [n/2\. We then add a copy of the gadget 



from Lemma 14 to ensure that Ii(r center ) lies between Ii(s Xi ) and for 
each value of i. For each clause Cj — y\ V 1/2 V 2/3, we add a new column tj. Then 
we add a copy of the gadget from Lemma 14 to ensure that I\ (tj ) lies between 



Ii(s yi ) and Ii(s y2 ). Then we add one more copy of the gadget from Lemma 14 
to ensure that Ii(r cente r) lies between I\(tj) and Ii(s y3 ). 



Note that the additional constraints forced by the gadget from Lemma 14 
mean that all of the following inequalities must hold: 

h{r C enter) < h(s Xi ), h(r center) < h(Smr), 

h {tj ) < h (%! ) , h (tj ) < h (s yi! ), 

Il(Tcenter) < ^(tj), I 2( T center) < Il(Sy?)- 

We can satisfy all of these constraints by first performing the second move of 
t 'center , then performing the second moves of all of the t columns, and finally 
by performing the second moves of all of the s columns. The other constraint 
imposed by the gadget from Lemma[l4]can be satisfied by dividing up the moves 
into two sequential stages such that all of the variable, clause, and center moves 
are performed exactly once per stage. Hence, the additional constraints enforced 
by our gadget do not affect our ability to construct an ideal solution, as long as 
is a member of ^-SAT. 

To see why this reduction works, suppose that is a member of ^-SAT. We 
must convert an assignment to the variables of cj> to an ideal solution to the subset 
of the Rubik's Cube constructed above. As noted in the previous paragraph, we 
can choose an ordering of all of the second moves that satisfies the gadgets we 
have constructed. To arrange the first moves of all of the columns, we pick an 
ordering of the columns s corresponding to literals so that Ii(s y ) < l\(r center) 
for all true literals y and ii(r oenter ) < Ii(s z ) for all false literals z. The ordering 
of the literals themselves does not matter, and can be picked arbitrarily. This 
arrangement ensures that for each Xi, we have either Ji(s Xj ) < l\(r center) < 
h(sx-), or h(sx-) < l\(r center) < h(s Xi )] either way, there will be a way to 
correctly arrange the extra columns and rows used by the gadget from LemmapT] 
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We then must pick times for the first move for each column tj. Let Cj — 
Di V 2/2 V 2/3 be the clause corresponding to the column we are considering. 

1. Consider the case where 2/1 and 2/2 are both true. Then we pick an arbitrary 
location for tj between s Vl and s V2 , so that there is a way to correctly arrange 
the extra columns and rows used by the copy of the gadget that ensures that 
I\{tj) lies between Ii(s yi ) and Ii(s V2 ). This means that h(tj) < I\(r center)- 
Because 2/1 and 2/2 are both true, then 2/3 must be false, and so 2i(r cen { er ) < 
11(2/3). Hence, there will be a way to correctly arrange the extra columns 
and rows used by the copy of the gadget from Lemma |14| 

2. Now consider the case where 2/1 and 2/2 are both false. Then we pick an 
arbitrary location for tj between s yi and s V2 . Just as before, this satisfies 



the gadget from Lemma 14 which forces I\(tj) to lie between I\(s Vl ) and 
I%(sy 2 ). This also means that I\(r C enter) < h(tj). Because both 2/1 and 2/2 
are false, 2/3 must be true, and therefore /1 (2/3) < Ii(T center ) < I%(tj). So 
once again, the gadget is satisfied. 
3. Now consider the case where 2/1 is true and 2/2 is false, or vice versa. Then 
2/3 is either true or false. If 2/3 is true, then 1 1(1/3) < h(r center), and so 
we pick the location for I\(tj) to be just slightly larger than I\(r ce nter), 
so that /i(2/3) < h(r center) < h(tj) and h(tj) lies between Ii(s Vl ) and 
I%(sy 2 ). Similarly, if 2/3 is false, then l\(r center ) < ^1(2/3), and so we pick 
the location for I\(tj) to be just slightly smaller than Ii(r cen ter), so that 
h(tj) < h(r C enter) < h(y3) and h(tj) lies between h(s Vl ) and h(s y2 ). In 
either case, the location for the first move tj will satisfy both of the gadgets 
created using Lemma [14} 

So if we have a solution to 0, then we also have an ideal solution. 

Now we wish to prove the converse. Suppose that we have an ideal solution. 
Then we construct a solution to cj) by setting Xi to be true if and only if 1\ (s Xi ) < 
h(r center)- Because of the gadgets that we constructed, we know that all true 
literals y have the property that I\(s y ) < I\(r 'center) , while all false literals z have 
the property that h(r C enter) < h(s z )- To see why this works, consider a clause 
Cj = 2/1 V 2/2 V 2/3. Assume, for the sake of contradiction, that all three literals 
in the clause are true. Then Ii(s Vl ), I%(s y2 ), Ix(s y3 ) < h(r C enter)- Because our 
betweenness gadget is working correctly, we know that h(tj) < Ii(r center), as 
well. This means that I\(r C enter) does not lie between I%(tj) and Ii(s y3 ), which 
means that our solution could not have been ideal. So our assumption must be 
wrong, and not all of the literals in the clause are true. A similar argument shows 
that not all of the literals in the clause are false. This means that the clause has 
at least one true literal and at least one false literal, and so is a member of 
7^-SAT. □ 



6 Optimally Solving an 0(1) X 0(1) X n Rubik's Cube 

For the ci x C2 x n Rubik's Cube with c\ 7^ n 7^ c%, the asymmetry of the puzzle 
leads to a few additional definitions. We will call a slice short if the matching 
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coordinate is z; otherwise, a slice is long. A short move involves rotating a short 
slice; a long move involves rotating a long slice. We define cubie cluster i to be 
the pair of slices z = i and z = (n— 1) — i. If n is odd, then cubie cluster (n— 1)/2 
will consist of a single slice. This definition means that any short move affects 
the position and orientation of cubies in exactly one cubie cluster. 

Lemma 15. Given any c\ x C2 x n Rubik's Cube configuration, and any cubie 
cluster t, the number of short moves affecting that cubie cluster in the optimal 
solution is at most 2 2c i C2 + 8 ( c i+ C2 ) - 1. 

Proof. Consider the subsequence of moves in the optimal solution which affect 
t. This should contain all of the long moves, and only those short moves which 
rotate one of the two slices in t. For notation purposes, we merge all consecutive 
long moves together into compound long moves Lo, . . . , Lk, so that the sequence 
of moves is L ° Si o L\ o s 2 o . . . o Lk-i o s fe o For convenience, we define 
s to be the identity function, so that we can write the sequence of moves as 
so Lq o si o Li o s 2 ° . . . ° S& o Lk- 

We define the following to be the results of performing certain moves: 

Seq(i, j) = Si o Li o s.i + i o Lj_|_i o . . . o Sj—i o Lj-i o Sj o Lj, 
LSeq(i, j) — Li o Lj + i o . . . o Lj-i o Lj. 

Assume for the sake of contradiction that there exist i < j such that Seq(0, i) o 
LSEQ(i+l,fc) = Seq(0, j)oLSEQ(j+l, k). Composing by the inverse of LSeq(j + 
1, fc), we obtain Seq(0, i) o LSeq(z + = Seq(0, j). Therefore 

SEQ(0,i) o LSeq(i + 1, j) o Seq(j + l,fc) = Seq(0, j) o Seq(j + l,fc) 

= SEQ(0,fc). 

Hence we can omit the moves s i+1 , . . . , Sj while still ending up with the cor- 
rect configuration for t. This means that there exists a sequence of moves, 
shorter than the original, which brings the Rubik's Cube into the same configu- 
ration. But the original sequence was optimal. Therefore, our assumption must 
be wrong, and Vi < j, Seq(0,z) o LSEQ(i + 1, k) ^ Seq(0, j) o LSeq(j + 1, fc). 

In both Seq(0, i) o LSeq(i + 1, fc) and Seq(0, j) o LSeq(j + 1, fc), the set of 
long moves is the same, so the configuration of all cubie clusters other than t 
must also be the same. Therefore, the results of those moves must differ in the 
configuration of the cubie cluster t. The cubie cluster with the greatest number 
of configurations is the cubie cluster which contains the ends of the Rubik's Cube 
(i.e., z = and z = n — 1), because of the additional information given by the 
exposed sides of the cube. For that cubie cluster, the total number of different 
configurations is < 2 2ciC2 ■4 4(ci+C2 '> = 2 2ciC2+8 ( Cl+c ^ . Each short move affecting t 
in the optimal solution must lead to a new configuration of t, and so the number 
of short moves must be < 2 2ciC2+8 ( Cl+C2 ) - 1. □ 

Lemma 16. There exists a sequence of long moves t\ o £ 2 ° ■ ■ ■ f-m, where 
m < (cic 2 )! • 2 1+ClC2 ; such that: 
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1. l\ o £ 2 o . . . o l m is the identity function; and 

2. for every long sequence L, there exists some i such that ^o^ +1 o. . .o£ m = L. 

Proof. Every sequence of long moves causes a rearrangement of the cubies in the 
Rubik's Cube. However, there are no long moves which break up the boxes of 
cubies of size 1 X 1 X n — each box can be moved or rotated, but the relative 
positions of cubies within the box are always the same. There are c\Ci such 
boxes, and each box can be oriented in two ways. This means that there can be 
no more than (C1C2)! • 2 ClC2 reachable long configurations. We treat these long 
configurations as a graph, with edges between configurations that are reachable 
using a single long move. If we take a spanning tree of the graph and duplicate 
all edges of the tree, then we can find an Eulerian cycle which visits all of the 
nodes in the graph. If we start at the identity configuration and move along the 
cycle, then we have a path of < (C1C2)! • 2 1+ClC2 long moves satisfying both of 
the above properties. □ 

Lemma 17. Given any C\ x c 2 x n Rubik's Cube configuration, the number of 
long moves in the optimal solution is at most (C1C2)! • 2 1+3ciC2+8 ( Cl+C2 ) . 

Proof. Assume, for the sake of contradiction, that the number of long moves 
in the optimal solution is greater than (c\c 2 )\ ■ 2 1+3ciC2+8 ( Cl+C2 ). Then we can 
construct another solution with the same number of short moves as the optimal, 
and fewer long moves. Let £\o£ 2 o. ■ .°i m be the sequence of long moves satisfying 
Lemma [IB] Let L be the sequence of long moves in the optimal solution, and let 
i be the index such that li o £ i+1 o . . . o£ m = L. We choose the sequence of long 
moves in our constructed solution to be 

{li o £ l+1 o . . . o £ m ) o(< 1 of 2 o...o £ m ) o...o(<iof 2 o...o £ m ) . 

" v " 

2 2c 1 c 2 +S(c 1 +c 2 ) _ 1 timcs 

This long move sequence has the same result as L. We must also specify how to 
interleave the short moves with the long move sequence. For a fixed long move 
sequence, the arrangement of short moves for one cubie cluster does not affect 
any other cubie cluster. Consequently, if we can correctly interleave the short 
moves for one cubie cluster, we can correctly interleave the short moves for all 
cubie clusters. 

Pick an arbitrary cubie cluster. Consider the subsequence of moves in the 
optimal solution which affect said cubie cluster. For notation purposes, we merge 
all consecutive long moves together into compound long moves Lq, . . . , Lk, so 
that the sequence of moves is Lq o si o L\ o s 2 ■ ■ • ^fe-i Sk ° Lk- We place 
the short moves into the above sequence of long moves starting with Sk ■ Let a 
be the index such that £ a o . . .0 £ m = Lk- We insert Sk into the fcth repetition of 
(£10 . . .0 £ m ) between l a -\ and £ a . This ensures that the sequence of long moves 
occurring after k will be equivalent to Lk- 

In general, say that we have placed Sj+i, . . . , in the (i + l)st through kih 
repetitions of (£\ o ... o £ m ). Say we want to place Si in the ith repetition. Let b 
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be the index such that Sj+i was placed between 4-i and 4- Let a be the index 
such that: 

£ a o...o l m =L i o(£ 1 o...o t h _ x )- 1 = U o o q\ o...ot- x o 

This ensures that the sequence of long moves between Si and s; + i will be: 
4 ° • • • ° 4n ° 4 ° • • • ° 4-i =L i o(£ l o...o 4-i) _1 ° (4 ° • • • 4-l) = Li 

Hence, if the number of long moves in the optimal solution is greater than 
(C1C2)! • 2 1+3ciC2+8 ( Cl+C2 ), we can create a solution with the same number of 
short moves and fewer long moves. This means that we have a contradiction. □ 

Theorem 6. Given any c\ x ci x n Rubik's Cube configuration, it is possible to 
find the optimal solution in time polynomial in n. 



Proof. By Lemma 17 we know that the total number of long moves in the 
optimal solution is at most (C1C2)! • 2 1+3ciC2+8 ( Cl+C2 ', which is constant. We also 
know that there are a total of C\ + possible long moves. Hence, the total 
number of possible sequences of long moves is constant, so we can enumerate all 
of these in time O(l). 

For each of the sequences of long moves, we want to find the optimal solution 
using that sequence of long moves. Because the long moves are fixed, we can 
calculate the short moves for each cubie cluster independently. To calculate the 
short moves for some fixed cubie cluster, we note that between two sequential 
long moves, there are at most four different ways to rotate each of the two slices 
in the cubie cluster, for a total of at most sixteen possible combinations of short 
moves. 

For a given cubie cluster, we have to consider < 16 1+ ( ClC2 ) ! ' 2 12 12 
possible combinations. This is constant, so we can try all possibilities to see if 
they solve the cubie cluster. We can pick the shortest of those. If we perform 
this operation for all cubic clusters, we will have an optimal solution for this 
particular sequence of long moves. If we calculate this for all sequences of long 
moves, then we can pick the overall optimal solution by taking the sequence of 
minimum length. □ 



7 Conclusion and Open Problems 

In this paper, we presented several new results. First, we introduced a technique 
for parallelizing the solution to two types of generalized Rubik's Cubes. As a 
result, we showed that the diameter of the configuration space for these two 
types of Rubik's Cubes is 0(n 2 /logn). In addition, we showed that it is NP-hard 
to find the shortest sequence of moves which solves a given subset of the cubies 
in an n x n x 1 Rubik's Cube. Finally, we showed that there exists a polynomial- 
time algorithm for solving Rubik's Cubes with dimensions c\ x C2 x n, where 
d^n^ c 2 . 
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Our results leave several questions open. The most obvious questions concern 
the NP-hardness result: whether it can be modified to show the NP-hardness of 
optimally solving the whole n x n x 1 Rubik's Cube, and whether it can be fur- 
ther modified to show the NP-hardness of optimally solving the whole nx nx n 
Rubik's Cube. The other questions concern approximation algorithms. In par- 
ticular, is there a constant-factor polynomial-time approximation algorithm for 
finding an approximately optimal solution sequence from a given configuration? 
The analogous question for the n 2 — 1 puzzle has a positive answer [ST]. The 
parallelism techniques we introduced for the diameter results seem to be central 
to developing such an approximation algorithm. 
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